Statistical Learning

Probability Distribution

Probability that a random variable (r.v.) $X$ takes every possible value.

P (X = x_{i}) = p (x_{i}), \forall i \in {1, \dots, n}

Satisfying:

\sum_{i = 1}^{n} p (x_{i}) = 1, p (x_{i}) \geq 0, \forall i \in {1, \dots, n}

Discrete Random Variable

Probability Mass Function (PMF)

Probability Mass Function (PMF) is the probability that the value of r.v. $X$ is $x_{i}$ :

Bernoulli Distribution

In a test, event $A$ happens with probability $μ$ , does not happen with probability $1 - μ$ .
If using r.v. $X$ to indicate the number of occurrences of event $A$ , then $X$ can be 0 or 1. Its distribution is:

p (x) = μ^{x} (1 - μ)^{1 - x}

Binomial Distribution

In the $n$ times Bernoulli distribution, if r.v. $X$ represents the number of occurrences of event $A$ , the value of $X$ is ${0, \dots, n}$ , and its corresponding distribution:

p (X = k) = (\binom{n}{k}) μ^{k} (1 - μ)^{n - k}, k = 1, \dots, n

The binomial coefficient represents the total number of combinations of elements taken out of $n$ elements regardless of their order.

Continuous Random Variable

Probability Density Function (PDF)

Probability distribution can be described by the Probability Density Function (PDF) $f (x)$ :

\int_{- \infty}^{+ \infty} f (x) d x = 1

Cumulative Distribution Function (CDF)

The Cumulative Distribution Function (CDF) is the probability that the value of r.v. $X$ is less than or equal to $x$ :

F (x) = P (X \leq x)

For continuous r.v., we have:

F (x) = \int_{- \infty}^{x} f (t) d t

Gaussian Distribution

X \sim N (μ, σ^{2})

P (x) = \frac{1}{\sqrt{2 π σ}} \exp (- \frac{(x - μ)^{2}}{2 σ^{2}})

Marginal Distribution

Marginal Probability Mass Function

$X$ 的边际概率质量函数：
$p_{X} (x_{i}) = \sum_{j} p (x_{i}, y_{j})$
$Y$ 的边际概率质量函数：
$p_{Y} (y_{j}) = \sum_{i} p (x_{i}, y_{j})$

Marginal Probability Density Function

$X$ 的边际概率密度函数：
$f_{X} (x) = \int_{- \infty}^{+ \infty} f (x, y) d y$
$Y$ 的边际概率密度函数：
$f_{Y} (y) = \int_{- \infty}^{+ \infty} f (x, y) d x$

Conditional Probability

For a discrete random vector $(X, Y)$ , when $X = x$ is known, the conditional probability of r.v. $Y = y$ is:

p (y | x) = P (Y = y | X = x) = \frac{p (x, y)}{p (x)}

Sampling

Sampling: given a probability distribution p(x), generate samples that meet the conditions

x^{(1)}, x^{(2)}, \dots, x^{(N)} \sim p (x)

Expectation

Expectation: the average of random variable

For discrete r.v. $X$ :

E [X] = \sum_{n = 1}^{N} x_{n} p (x_{n})

For continuous r.v. $X$ :

E [X] = \int_{- \infty}^{+ \infty} x f (x) d x

Law of Large Numbers

When the number of samples is large, the sample mean and the real mean (expectation) are fully close.

Given N independently and identically distributed (I.I.D.) samples

x^{(1)}, x^{(2)}, \dots, x^{(N)} \sim p (x)

Its mean value converges to the expected value:

\overset{―}{X_{N}} = \frac{1}{N} (\sum_{i = 1}^{N} x^{(i)}) \to E [X] f o r N \to \infty

Algorithm

Tutorial

assignment

Assignment

As-1

As-2

Lab-1

Lab-2

Lab-3

Lab-4

GAMES101

Assignment-1

Assignment-2

Assignment-3

Assignment-4

Lab

Lecture

Peoject

CSCN

Ploidy

Statistical Learning ​

Probability Distribution ​

Discrete Random Variable ​

Probability Mass Function (PMF) ​

Bernoulli Distribution ​

Binomial Distribution ​

Continuous Random Variable ​

Probability Density Function (PDF) ​

Cumulative Distribution Function (CDF) ​

Gaussian Distribution ​

Marginal Distribution ​

Marginal Probability Mass Function ​

Marginal Probability Density Function ​

Conditional Probability ​

Sampling ​

Expectation ​

Law of Large Numbers ​